General Disclaimer 


One or more of the Following Statements may affect this Document 


• This document has been reproduced from the best copy furnished by the 
organizational source. It is being released in the interest of making available as 
much information as possible. 


• This document may contain data, which exceeds the sheet parameters. It was 
furnished in this condition by the organizational source and is the best copy 
available. 


• This document may contain tone-on-tone or color graphs, charts and/or pictures, 
which have been reproduced in black and white. 


• This document is paginated as submitted by the original source. 


• Portions of this document are not fully legible due to the historical nature of some 
of the material. However, it is the best reproduction available from the original 
submission. 


Produced by the NASA Center for Aerospace Information (CASI) 





ELECTRON LITHOGRAPHY STAR 
DESIGN GUIDELINES 
PART III of IV: The Mosaic Transistor Array 
Applied to Custom Microprocessors 

PART IV of IV: Stored Logic Arrays - SLAs 
Implemented with Clocked CMOS 


g.nt>n7 


(HAS A-CB - 1 707#>7) ELECTRON LITHOGRAPHY STAB Mfc * — j 1 e*J2 

SrSIGN GUIDELINES. PI 51 3: Iht HOSAIC 

TRANSISTOR ARRAY A££L1EC TC CCSTOfl 

flICRC PROCESSORS. PART Ui S1CRE5 LOGIC UncldS 

((Ussissj-ppi State Univ. , Bississippi G3/60 16022 


ENGINEERING S. INDUbmiJTT Wl 


ELECTRICAL ENGINEERING 


[ARCH STATION 


FINAL REPORT - PARTS III & IV 


Submitted to: 

National Aeronautics and Space Administration 
George C. Marshall Space Flight Center 

Principal Ivestigator 



ir> -J 

s -s 

k - 1 


Date of general rs lease 


Mississippi State, MS 39762 


MSSU-EIRS-EE-83-4 






** ^ ^ . - 


Unclassified 

4ECU *ITY CLASSIFICATION OF This PAGE (Whon Data f nt trod) 

f REPORT DOCUMENTATION PAHS 


P 


REPORT NUMIFP 

MSSU-EIRS-EE-83-4 


2 . gov; accession no. 


4. Title (and Sub*' t la) 

Electron Lithography STAR Design Guidelines 
Part 3: The Mosaic Transistor Array Applied 
to Custom Microprocessors 
Part 4: Stored Logic Arrays - SLAs 
7. AuTMoar#; 

J. Donald Trotter, Principal Investigator 


READ INSTRUCTIONS 
BEFORE COMPLETING FORM 
recipient's catalog number 


I. TYPE op REPORT a PERlOO COVERED 

Final Report 
Parts 3 and 4 of 4 

•. performing org. pepopt number 


• contract op grant number^ 


NAS8-33450 


? PERFORMING organization name ano ADDRESS 

Mississippi State University 
Drawer EE 

Mississippi State, MS 39762 

n. controlling office name ano aooress 

| NASA/MSFC 

MSFC, AL 35812 

ATTN: Harley R. Hope AP29-C 


10. PROGRAM element, project, T* < 

AREA A PORK UNIT NUMBERS 


12. REPORT DATE 


August 31, 1982 

17. NUMBER OF PAGES 

73 


MONITORING AGENCY NAME a AOORESS^// dtf'aront from Controlling Otfl co) IS. SECURITY CLASS, (ot this ropori) 


ONRRR, Georgia Institute of Technology 
206 O'Keefe Building 
Atlanta, GA 39332 


Unclassified 


IS*. OECLASSi F| CATION' DOWNGRADING 
SCHEDULE 


1 14 . distribution statement rot uua Report) 


| 17. DISTRIBUTION ST ATEMENT (ot tho abstract ontorod In Blozk 20, It dlttoronl from Ropori) 


1 9. SUPPLEMENTARY notes 


C.COJt'-JAV r.Vli ' ; 

OF POOR QUALITY 


19. KEY WOROS (Continue on rororao aide 1C nocoeoary mid Identity by block numbor) 

Fart 3: STAR, Semicustom microprocessor. Data path 

! Part 4: Stored Logic Array (SLA), Programmable Logic Array (PLA) , 

j Clocked CMOS, STAR, Mosaic Transistor Array (MTA) 

20. ABSTRACT f Contlnuo on reveree aide It nacaaaary and Idmntlty by block numbor) 

Part 3: The Mosaic Transistor Array is an extension of the STAR system 

developed by NASA which has dedicated field cells designed to be 
specifically used in semicustom microprocessor applications. The basic 
loaic functions for a data path are designed with compatible interface 
to the STAR grid system. 

Part 4: Stored logic arrays are folded PLAs with the AND and OR planes 

merged into one physical space. The structure is shown to be compatible 


DO , 1473 EDITION OP 1 NOV «1 IS OBSOLETE 

S/N 0102-LF-0144601 


SECURITY CLASSIFICATION OF THIS PAGE (When Dmia Bnterodj 



FINAL REPORT 
CONTRACT NAS8-33450 

ELECTRON LITHOGRAPHY STAR DESIGN GUIDELINES 
Part 3: THE MOSAIC TRANSISTOR ARRAY APPLIED TO CUSTOM 
MICROPROCESSORS 

Part 4s STORED LOGIC ARRAYS - SLAs IMPLEMENTED WITH 

CLOCKED CMOS 


Principal Investigator 
J. Donald Trotter 


Mississippi State University 
Department of Electrical Engineering 
Mississippi State, Mississippi 39762 


August 31, 1982 


for 

NATIONAL AERONAUTICS AND SPACE ADMINISTRATION 
Marshall Space Flight Center, AL 35812 


ELECTRON LITHOGRAPHY STAR DESIGN GUIDELINES 


Part 3: The Mosaic Transistor Array Applied 
to Custom Microprocessors 



j 

Principal Investigator 

J. Donald Trotter i 

| 3 


i 

I 

i 

* 

i 





THE MOSAIC TRANSISTOR ARRAY 
APPLIED TO 

CUSTOM MICROPROCESSORS 


Out lint 

I. Intr oduot i on 1 

II. The Sandia CMOS Prooess with Double Layer Metal 3 

III. The Design Approach 12 

IV. The Data Path Circuitry 17 

V. Additional Comments 29 

VI . Aooend i x 30 


LIST OF FIGURES 


Fiflurt 

i 

Frocftis Cross-Stot ions 

6,7 

F i qur • 

2ft 

CMOS Invftrtftr 

14 

F i our t 

2b 

Thrift Tyoftft of Switch** 

14 

F i our • 

3 

A CMOS Lfttoh 

15 

F i qur ft 

4 

Block Diagram of Lfttoh 

16 

Fi qur ft 

5 

Lfttch with Multiplft Input* 

16 

F i gur ft 

6 

Chip Plftn for Dfttft Pftth 

19 

Fi qur ft 

7 

Gftnft r ft 1 i * • d Funot ionil Blook 

20 

F i gur • 

8 

Block Diftgram for ft 1 Bit ALU 

21 

Fi qur • 

9 

Cont r ol Buff ftr* 

22 

F i gu r ft 

10 

Two Port Rftoi*t#r 

23 

Fi qur ft 

1 1 

A 4x4 Barr «1 Shi f tor 

24 

F i gu r ft 

1 2 

Bus Cirouitry and Timing 

25 

F i qur • 

1 3 

Litoral Buffer 

26 

F i gu r • 

1 4 

Input /Output Latchft* 

27 

Fi qur • 

1 5 

Tr i -St a t * I/O 

28 


iii 




ORIGINAL PAGK IS 
OF POOR QUALITY 


SUMMARY 


The NomIo Transistor Array is an extension of the 
STAR systftn developed by NASA which has dedicated field 
o« 1 1 s designed to bs spooifioally used in semioustom 
nioroprooftiior applications. Th# Sandia radiation hard bulk 
CKOS process is utilised in ord#r to satisfy th# 

r ft qu i r emen t s of spao# flights. A design philosophy is 
developed which utilises the strengths and recognises the 
weaknesses of the Sandia prooess. A style of oircuitry is 
developed which incorporates the low power and high drive 
capability of CKOS. In addition the density achieved is 
better than that for classic CMOS, although not as good as 
for NMOS. 

The basio logic functions for a data path are 
designed with compatible interface to the STAR grid system. 
In this manner either random logio or FLA type structures 
can be utilised for the control logic. 


v 


ORIGINAL RAGS 13 
OF POOR QUALITY 


THE MOSAIC TRANSISTOR ARRAY 
APPLIED TO 

CUSTOM MICROPROCESSORS 


X. Introduction 


NASA ot Marshall Spaoe Flight Center has developed 
t ha Standard Transistor Array (STAR) as a moans of providing 
ouiok turn-around to tho dos i on- f abr i oa t i on oyolo for oustom 
intogratod oirouits. It is in ossonoo a somioustom approaoh 
utilising two lovols of motal intorconnoot for oustomising a 
ohip to an application. It orovidos a moans for fabrication 
of tho diffusod undo r s t r uo t ur o at ono location whilo tho 
oustomising doublo-layor motal can bo appliod at anothor . 
This allows oroanisat ions with profioionoy in tho thin film 
hybrid fiold to placo tho turn-around of oustom I. C. 
development under their own oontrol. 

Tho associated STAR software allows for any mix of 
automatic layout and hand layout desired. Since tho 

int oroonnoot ions are regimented into a vo r t i o a 1 -ho r i s on t a 1 
format, a reasonable display of tho ohip design oan bo 
mapped onto tho column-row oriented lino printer output. Tho 
STAR software system therefore does not demand an 
interactive graphics capability as a prerequisite for 
efficient operation. 

Due to the double-layer metal aspeot , STAR provides 
higher density than former automatio layout sohemes whioh 
utilise only one layer of metal. The ohip unde r s t r uo t ur e is 
paoked with transistors and the interconnections are handled 
overhead in the metal layers. The STAR scheme oalls for all 
metal to be routed over a rigid grid structure. The 

interconnect points of the under s t rue t ur e transistors to the 
metalisation layer, is., the grid points, are also fixed. 
The nature of the layout and fabrication of the 
unde r s t r uo t ur e oan be varied in all other aspects. Thus, 
the STAR aoproaoh oan be suited to variations of technology. 
The intent is to separate the ohip logio (in terms of 
interconnections of grid points) from the technology (in 
terms of solid state devioes which feed through up to the 
grid). Logio, onoe defined, oan easily be transferred from 
one technology to another or scaled up or down as desired. 
The under st ruoture consists for the most part of two 
complementary transistors that are replicated over the whole 
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chip. The task of ohanging technologies is reduced to 
redesigning two transistors. The eooommoda t i on within a 
technology to a speoifio vendor's desires on design rules 
can also be handled easily by redesigning the two 
t r ans is t or s . 

The STAR format dedioates certain of the horisontal 
and vertioal lanes to the task of oonverting transistors 
into standard logic cell designs in order to ease the burden 
on automatlo layout computation. This leaves dear lanes, 
plus unused oell lane segments, for ths global 

in tsroonneot ion of oella into ths final ohip design. STAR 
provides an sxoellent solution for ouetom integrated olrouit 
needs in the area of random logic. There is a considerable 
need for suoh oapabilitv in the interfacing of a standard 
of f - 1 he-sht 1 f mi or opr ooessor s and their associated chips out 
to the real world. The STAR approach is not area effiolant, 
however, in aotual development of new microprocessors or 
similar ohips. 

The Mosaic Transistor Array (MTA), the subjeot of 
this research effort/ is an approaoh aimed at reduolng the 
turn-around time in the development of mi or opr ooessor or 
similar Chios. Suoh chips tend to have segments devoted to 
RAM, ROM, FLA, and register activities. The STAR format, 
whioh is optimised for random logio, provides mors 

interoonneot lanes than are needed for these very regular 
struotures. The MTA provides a small variety of 
under st ruotures to acoommodat# eaoh suoh aotivity. These 
underst ruotur es or field oells still maintain the same grid 
for compatibility with the random logio field oell, but the 

density of transistors may inorease by a faotor of four. 

The MTA loses the semioustom capability of the STAR. 
Prudent inclusion of extra random logio and/or FLA field 

cells into a design provides elbow room for mistakes or 
changes in oontrol section. In oonoept one only has to 
revise ths metal level masks. Wafers having the previous 

under st ruo ture proosssing would still be good for ths new 
ohip logio design. 

A requirement for “total dose" radiation hardness 
constrains the design to bs compatible with an available 
“hard" prooess. The Sandia silicon gate bulk CMOS prooess 

has bsen seleoted. This proosss prssents interesting 

oonseguenoss for VLSI typs designs. Channeling over the 
p-we 1 1 is minimised by requiring the poly gate to extend 

over thin oxide into the F+ guard ring. This oonstraint 
oompromisss the normal CMOS oirouit density -- much less 
compared with NMOS mi or opr ocsssor s . 

The goal of this rssearoh projeot is to develop at 
least a first generation set of oells suitable for 

implementing the baeio computer functions. In order to 
aohieve flexibility for various appl ioat ions , a oustomised 
oontrol strueture is visualised, possibly through 
mi oropr ogr amming . In addition an organised approaoh to the 
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assembly of t ho svstem building blooko from both eirouit and 
topology points of view is rtquirtd. A generalised data 
path (bit-slioe) struoturo with a finite-state stohint 
oontrci l«r offors tuoh an aporoaoh. Furthermore, tho data 
flow through tho bit-slioe oan bo optimised for porf oraaneo. 


II. Tho Sandia CMOS Proooss with Doublo-Layor Mstal 


Tho Sandia proooss has boon ohoson slnoo it 
roprosonts a bulk ailicon, r adiat ion-hard proooss suitabio 
for 1 ow-powor spaoo applications. Double layor metal add-on 
prooossing is asssuood for this design -- just as in tho 
oaso for tho STAR program. Tho Sandia proooss is a p -wo 1 1 
proooss with a separate F+ guard-ring diffusion. 
Consequent 1 y , tho all N+ poly 1 n t e r oonneo t 1 on / g a t e layor is 
permitted to oross tho field and p -wo 1 1 boundary. Tho 
fabrioation stops required for n-type and p-type devioes in 
tho Sandia proooss are osplainod with tho aid of a series of 
or oss-seo t i ona 1 drawings in Figure 1. 

Tho fabrioation proooss begins with tho growth of a 
thermal oxide on an n-type silioon wafer. Figure la shows 
tho orosi-sootion of tho wafer after tho thermal oxide has 
boon grown. Tho next sot of stops develops tho p-well whioh 
is noooss ary for tho isolation of tho NMOS devioes. In 
order to oroato tho p-well, a hole must be made in tho 
thermal oxide; this is done by a masking proooss. Tho 
p-well mask allows tho thermal oxide to be removed in 
regions whore a p-well is desired. The p-well is implanted 
and driven-in. A thin oxide is grown over tho well in 
oonlunot ion with the drivo-in. The end result is indioated 
in Figure 1 -b . 

Tho F+ guard ring is the next addition. This guard 
ring serves to prevent tho formation of a n-type channel 
over the lightly doped p-well -- even with tho trapped 
positive ohargos in tho oxide duo to high doses of 

radiation. It also servos to inhibit SCR latch-up. Tho 
Dlaoement of tho guard ring is controlled by a mask. Figure 
l-o displays the wafer after a boron diffusion has produced 
tho P+ guard ring. Tho guard ring mask (photoresist), 
indioated by tho region enclosing the wavy linos, is shown 
in the f i gore . 

A thiok oxide, referred to as tho field oxide, is 
then grown over the entire wafer. A mask is used to remove 
tho thiok oxide in regions whore a thin oxide is desired. 
Figure 1-d illustrates the results. 

Tho philosophy of tho Sandia process is to have a 
thin oxide oover the entire p-well, including tho guard ring 
and the FMOS devioes. Tho other regions are ooverod by a 
thiok oxide. It should be noted that the exposure to 

radiation oreates ho 1 e-el eo t r on pairs in the or ids. The 
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electrons sovft relatively freely with aodtnti «UoUlo 
fields and Art drained o If by the positive potential 
in t or oonneot 1 one . The hoiee ere trapped at the oaide 
interface, resulting in an effective aooumulation of 
positive oharpe whioh attraete eleotronio charge in the 
eilioon below. The charge trapoed ie related to the oxide 
thickness to the second power, as a minimum . Consequently, 
over 1 i ght 1 y-doped p-regione, only thin gate oaide is used 
in order to prevent channeling between "unrelated” N+ nodes. 

The thin oaide is provided by growing an oxide ever 
the entire wafer. This oxide growth does not appreciably 
change the thickness of the thiok oxide already present. A 
uniform polysilicon <poly> deposition follows the thin oxide 
growth. The poly is doped N+ and will be used as the gates 
for all the transistors, both MMOS and FMOS. A mask 
referred to as the poly mask is used to remove the unwanted 
ooly. The poly that remains is oovered by growing a layer 
of thin oxide. Figure i-e shows the oross-seotion of the 
wafer at this point in the fabrication prooess. 

The wafer is now ready to be implanted with an N+ 
material whioh will provide the souroes and drains of all 
the NMOS devices. The N+ lmolant mask (photoresist) is 
plaoed on the chip and the N+ material Implanted. The 
results from this N+ implant step are indicated in Figure 
1-f. The mask is still in plaoe as indioated by wavy lines. 
It is obvious from the figure that the placement of the 
souroe and drain regions in the p-well for the NMOS devioes 
is controlled by the implant mask and the poly. This 
provides a se 1 f - a 1 i gnmen t of the NMOS transistors whioh is 
vital for proper operation of the device. One notes that a 
He diffused region oan be plaoed over the n-field in order 
to achieve ohmio oontact to the n-substrate. The N+ implant 
mask must fall within the thin-oxlde region. 

Similar to the KMOS devioes, the FMOS devices are 
also se 1 f -a 1 1 gned . The F+ Implant mask is used with a boron 
implant to dope the F+ regions. Figure 1-g illustrates the 
results. From the figure it is seen that the placement of 
the souroe and drain regions of the FMOS devices is 
controlled by the thick oxide and the poly. The F+ implant 
mask does not allow the F+ material to be implanted inside 
the p-we 1 1 exoept in regions for ohmic contaot to the 

p -wt 1 1 . 

The removal of the F+ implant mask is followed by a 
glass ooating. The cross-section of the wafer after this 
step appears as in Figure 1-h. It is seen in the figure 
that once the NMOS and FMOS devioes have been oonstrueted it 
remains only to provide the metal ini er conneo t i one between 
devices. 

As mentioned previously, double-layer metal 
processing is assumed. Before the first layer of metal is 
deposited on the ohip, holes are out into the gla&s that is 
present on the chip. This is done through another masking 
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■ t«p; the meek ie t h i * eese referred to n the oontict meek, 
figure i-i illuetrt.ee the ohip after thie masking etep. 
The holes produced in the gUee are oalled contacts# and the 
o*:ip is ready for the firet layer of metal. 

The first layer of metal# sometimes a silieide, is 
deposited on the chip. A mask is used to remove the metal 
in looations where it is unwanted. This technique was used 
when the poly was laid on the ohip and will be used again 
for the seoond layer of metal. After the first metal 
processing steps, the ohip appears as in Figure i-f . The 
metal is indicated in the figure by the orore-het chad area. 
In the figure the drain of the NMOB device has been 
conneoted to the drain of the FMOft devioe. Other 

connections between rfevioes oan be made by use of the metal. 

The first metallisation is followed by another glass 
ooating similar to the glass ooating step whioh followed the 
F+ Implant. Before connections are made using the second 
metallisation# holes are out in the glass. Again a mask 
<oalled a via mask) is used. The resulting holes in the 

glass are known as vias. The or oss-seo t i on of the chip 
after the vias have been etched is indioated in Figure 1-k. 

The seoond layer of metal# appropriately referred to 
as top metal# is deposited on the chip. Onoe again a mask 
is used during the etohing prooess to remove undesirable top 
metal. This seoond me t s 1 1 i i a t i on step provides the 

remaining devioe oonneotions that oould not be completed 
with first metal. A final glass overcoat is plaeed on the 
chip as shown in Figure 1-1. The top metal has been 

cross-hat ohed opposite to that of first metal. According to 
the figure the oonneotion of the gates of the NMOft and FMOft 
devices has been made by the top metal. This second 

metallisation completes the prooess esoept tor a pad mask 
step that allows terminal oonneotions of the paoxaged ohip 
to the integrated circuit. 

The Sandia prooees oan be auiokly summarised by 
reviewing the different masks rsguired during fabrication. 
Ten masks saoluding ths final pad mask oompriss the oomplets 
set of required masks. Thes# masks are in order: p-well, F+ 

guard ring# thin oxide# poly gate# N+ implant# F+ implant# 
oontaot# first metal# via# and seoond (top) metal. 

The design rulss used for tho MTA project are ths 
same ae those developed for the radiation-hard STAR design. 
These rules art rspsatsd in ths following Tabls for the 
reader's oonvenienco. 
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NASA 

SANDIA 

MASK 

DESCRIPTION 

(MILS) 

(MICRONS) 

LAYER 1: 

P-WELL 



1.1.1 

MINIMUM WIDTH 

.2 

5 

1.1.2 

MINIMUM SPACE 

1.0 

25 

LAYER 2: 

P+ CUARD-BAND 



2.2.1 

MINIMUM WIDTH 

.2 

5 

2.2.2 

MINIMUM SPACE 

.6 

15 

2.2.3 

MINIMUM WIDTH AROUND WELL 

.3 

7 

2.1.1 

INSIDE EDGE OF GUARD BAND 
OVERLAP TO OUTSIDE OF P-WELL 

0 

0 

LAYER 3: 

THICK OXIDE (ALIGNED TO P+ GUARD-BAND) 



3.3.1 

MINIMUM THICK OXIDE WIDTH 

.3 

7 

3.3.2 

MINIMUM THIN OXIDE SPACE 

.3 

8 

3.3.3 

MINIMUM SPACE (P+ TO P+) 

.5 

12 

3.2.1 

MINIMUM THIN OXIDE OVERLAP 
OF GUARD BAND EDGE 

.2 

5 

3.2.2 

MINIMUM THICK OXIDE TO 
GUARD BAND WHEN USED TO 
DEFINE N+ 

.325 

8 

3.2.3 

MINIMUM THICK OXIDE TO 
GUARD BAND WHEN USED TO 
DEFINE P+ DRAIN 

.65 

17 

3.2.4 

SAME AS 3.2.3 EXCEPT 
P+ SOURCE 

.5 

13 

3.1.1 

MINIMUM THICK OXIDE TO 
P-WELL WHEN USED TO 
DEFINE P+ DRAIN 

in 

00 

• 

21 

3.1.2 

SAME AS 3.1.1 EXCEPT 
P+ SOURCE 

.675 

17 

3.3.3 

MINIMUM THIN OXIDE TO 
SCRIBE LINE 

2.0 

50 


Table la - NASA Design Rules For Sandia Process 
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i-lASK 

DESCRIPTION 

NASA 

(MILS) 

SANDIA 

(MICRONS) 

LAYER 4: 

POLY SILICON (ALIGNED TO THICK OXIDE) 



4.4.1 

MINIMUM LINE 

.25 

6 

4.4.2 

MINIMUM SPACE 

.25 

6 

4.4.2 

MINIMUM PMOS GATE LENGTH 

.25 

5 

4.4.3 

MINIMUM NMOS GATE LENGTH 

.2 

8 

4.3.1 

MINIMUM POLY TO THICK 




OXIDE 

.075 

1 

4.2.1 

MINIMUM POLY TO GUARD BAND 




WHEN POLY DEFINES N+ 

.35 

9 

4.2.2 

SAME AS 4.2.1 EXCEPT POLY 




DEFINES P+ DRAIN 

.7 

18 

4.2.3 

SAME AS 4.2.2 EXCEPT POLY 




DEFINES P+ SOURCE 

.6 

14 

4.3.2 

MINIMUM POLY GATE OVERLAP 




OF THICK OXIDE 

.2 

5 

4.2.4 

MINIMUM POLY OVERLAP OF 




GUARD RING 

.25 

6 

4.1.1 

MINIMUM POLY TO P-WELL WHEN 




POLY DEFINES P-4- DRAIN 

.9 

22 

4.1.2 

SAME AS 4.1.1 EXCEPT POLY 




DEFINES P+ SOURCE 

.7 

18 

LAYER 5: 

N+ IMPLANT (ALIGNED TO THICK OXIDE) 



5.5.1 

MINIMUM WIDTH 

.2 

5 

5.5.2 

MINIMUM SPACE 

.2 

6 

5.2.1 

MINIMUM N+ TO P+ GUARD 

.4 

10 

LAYER 6: 

P+ IMPLANT (ALIGNED TO THICK OXIDE) 



6.6.1 

MINIMUM WIDTH 

.2 

5 

6.6.2 

MINIMUM SPACE 

.5 

12 

6.2.1 

MINIMUM P+ DRAIN TO 




P+ GUARD 

.75 

19 

6.2.2 

MINIMUM P+ SOURCE TO 




P+ GUARD @ VDD 

.6 

15 

6.2.3 

MINIMUM P+ DRAIN TO 




P+ GUARD 0 VDD 

.6 

15 

6.3.1 

MINIMUM P+ TO THICK OXIDE 




WHICH DEFINES P+ 

.2 

5 

6.1.1 

MINIMUM P+ TO P-WELL 

.9 

23 

6.1.2 

MINIMUM P+ TO P-WELL 




0 VDD 

.75 

19 

6.5.1 

MINIMUM P+ OVERLAP OF 




N+ FOR SHORTING 

.05 

1 


Table “(Continued) . NASA Design Rules For Sandla Process 
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riASK 

description 

NASA 

(MILS) 

SANDIA 

(MICRONS) 

LAYER 7: 

CONTACT (ALIGNED TO THICK OXIDE) 



7.7.1 

MINIMUM CONTACT 

.3 x .3 

6x6 

7.3.1 

MINIMUM CONTACT INSIDE 




THICK OXIDE DIFFUSION 

.2 

ii 

7.4.1 

MINIMUM CONTACT INSIDE 




POLY TO EDGE 

.15 

4 

7.4.2 

MINIMUM CONTACT OUTSIDE 




POLY TO EDGE 

.2 


7.5.1 

MINIMUM CONTACT INSIDE 




N+ DIFFUSION 

.2 

5 

7.6.1 

MINIMUM CONTACT INSIDE 




P+ DIFFUSION 

.2 

5 

7.3.1 

MINIMUM CONTACT FROM 




OXIDE STEP 

.2 

5 

7.7.2 

MINIMUM CONTACT FOR SHORT- 




ING OVERLAP DIFFUSIONS 

.3 

6 


MINIMUM P+ GUARD CONTACT 

.3 

7 

LAYER 8: 

METAL (ALIGNED TO CONTACT) 



8.8.1 

MINIMUM METAL WIDTH 

.5 

8 

8.8.2 

MINIMUM METAL SPACE 

.3 

7 

8.7.1 

MINIMUM METAL CONTACT 




OVERLAP 

.1 

1 

8.3.1 

MINIMUM METAL TO SCRIBE 




LINE 

2.0 

50 

LAYER 9: 

VIA (ALIGNED TO FIRST METAL) 



9.9.1 

MINIMUM VIA 

. 3 x .3 

6x6 

9.8.1 

MINIMUM VIA INSIDE METAL 

.1 

1 
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MASK 

DESCRIPTION 

NASA 

(MILS) 

SANDIA 

(MICRONS) 

LAYER 10: 

SECOND METAL (ALIGNED TO VIA) 



10.10.1 

MINIMUM METAL WIDTH 

.5 

8 

10.10.2 

MINIMUM METAL SPACE 

.3 

7 

10.9.1 

MINIMUM METAL OVERLAP 




OF VIA 

.1 

1 

10.3.1 

MINIMUM BONDING PAD TO 




THIN OXIDE 

1.5 

40 

10.10.3 

MINIMUM BONDING PAD TO 




SECOND METAL 

1.5 

40 

10.8.1 

MINIMUM BONDING PAD TO 




FIRST METAL 

1.5 

40 

10.10.4 

MINIMUM UNBUFFERED 1 




PAD SPACING 

8.5 

220 

10.10.5 

MINIMUM BUFFERED PAD 




TO PAD SPACING 

12.5 

320 

10.10.6 

MINIMUM PAD SIZE 

4x4 


10.10.7 

MINIMUM VSS & VSS BUS 




WIDTH 

.6 

16 

10.3.1 

MINIMUM METAL TO 


e a 


SCRIBE LINE 

2.0 

50 

10.11.1 

PAD MASK INSIDE METAL PAD 

.175 

4 
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A generalised bit-slioe structure with a 
finite-state maohine controller is visualised to offer the 
flexibility objectives of this work. Mead and Conway 
suggest such an approach in their design test. However, 
their circuit philosophy ‘s based on silicon gate NMOS with 
depletion devioes -- a technology not suitable for the 
radiation environment of space flights. Their 
implementation of logio by means of r eg i s t e r - t o-r eg i s t er 
transfer paths through combination logio with each register 
being gated with a particular olcok phase is not new; 
however, their eitensive use of pass transistor logio to 
implement the combinational logio is unusual. Ratioed logio 
is principally restricted to the latohes and registers; the 
ratioless pass transistor logio achieves the extended AND 
function at essentially sero stand-by power with the only 
performance compromise being fan-in. In this case the pass 
transistors are bi-lateral switohes supplying current as a 
source-follower and sinking current as common source devioe. 
The souroe follower con f i pur a t i on has two shor t -oomings : 

1) Its final output value <pull-up> is limited by 
the body-effect to approximately 3.5 volts from a 
5 volt supply -- even for a high performance NMOS 
pr ooess . 

2) The time for the 0 to 3.5 volt transition is 
approximately five times longer than for the 3.5 
to 0 volt transition (operating as a common source 
dev ice). 

Nevertheless, the simplicity of pass transistor logic is 
very attractive, particularly for VLSI designs. Precharging 
,c high M oan minimise transition delays in many oases. 

Attempting to apply these ooncepts to CMOS 
technologies presents some interesting dilemmas. Replacing 
the ratioed inverters in the latohes and registers by CMOS 
inverters is a straight-forward decision; however, the 
"structured" combinational logio poses a problem. Pass 
transistor switches, historically, in CMOS are implemented 
with pairs of devioes, one NMOS and one PMOS for each 
switch, in order to avo * d using a devioe in its compromised 
s ou r ce - f o 1 1 owe r configuration. A bulk CMOS process with the 
T1MOS devioes built in a p-well, suoh as the Sandia process, 
provides NMOS devioes with poor device characteristics 
compared with standard NMOS devioes. The high p-well doping 
in the Sandia process (higher than normal for radiation 
hardness) produces a greater body-effeoi and reduced channel 
mobility compared with the standard. Consequently, the 
Sandia NMOS device is particularly unsuitable as a 
source-follower. As a result a direct implementation of the 
pass-transistor logio utilised by Mead and Conway is not 
oraotioal in this design effort. 
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On the other hand classical CMOS is not praotioal 
for VLSI either. The CMOS devioe oount for combinational 
logio is approximately twice that for NMOS. Further, 
implementing pass-transistor logio with classioal CMOS 
requires true and compliment signals to drive the NMOS and 
FMOS pair -- thus requiring roughly twioe the number of 
oontrol signals and drivers as for NMOS. 

One alternative approach remains: implement 
pass- t r ans i s t or logic with FMOS only, utilising the faot 
that the FMOS devioes in the Sandia prooess have a 
relatively low body-effect. In addition the P+ guard ring 
diffusion can be used as a diffusion tunnel under poly 
in t er oonneo t i ons to provide a much needed oross-under 
oa pab i 1 i t y . 

Consider a CMOS latch with a FMOS pass - t r ans i s t o r 
serving ? a multiplexing switch. The FMOS pass-gate will 
provide i output swing from about 1.3 volts to the power 
supply. Adjusting the ratio of the first inverter in the 
latch can provide the neoessary level shifting. However, an 
additional NMOS devioe is required with the PMOS feedback 
transistor to supply the full voltage swing after 
M latoh-up", thus providing the full gate drive signals, 
facilitating the full output drive from the inverters. 

In this design philosophy the NMOS transistors, 
whioh reouire the extra area for channel stops, are utilized 
only where neoessary. The NMOS devioe is an effective 
switch in the oommon-s our oe configuration; however, a PMOS 
device is required for pull-up either in a classical CMOS 
fashion or precharged in a docked scheme. 
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FIGURE 2 a - CMOS INVERTER 



FIGURE 2t - THREE TYPES OF SWITCHES 
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The Da t « Pftth Cirouitry 


following tho philosophy of Mood and Conw*y tho data 
path is organitod as parallol bit-slioes from on# port to 
anothor port at illustrated in Figure 6. Two difforont 
bust* run botwoon tho two I/O porta and paaa through tho 
dual port rogiotor array, tho barrol shifter, and tho ALU. 
Tho ALU la built around a Manohoa tor-typo oarry oirouit. 
Tho carry-out signal la proohargod high prior to logio 
evaluation (phaso 2) and roaaint high union tho kill signal 
(K) is truo or F*Cin* is truo whoro F is tho pass signal and 
Cin 1 is compliment of tho carry-in signal. 

Gout ' • K * F*Cin' 

Tho kill and pass signals aro gonoratod by gonoralisod 
function blocks, Figuro 7. Eaoh blook oontains four oontrol 

signals and is driven by truo and ooapl iaont signals from 

two data souroos, e.g.. 

Gout » GO * A 1 *B * ♦ G1 *A * *8 ♦ G2^A*B* ♦ G3*A*B 

whoro A 4 8 aro data signals and GO - G3 aro oontrol signal* 
dotornlnod by tho i n t or pr o t a t i on of tho instruction. 

Tho ALU is illustratod in Figuro 6. Tho oarry ohain 
oireuitry is implemented in KUOS sinoo it is proohargod high 
during phaso 1 and conditionally discharged low during phaso 
2. Tho NMO0 devioe is faster in this mods than tho FMOS 

device . In addition tho X and P funotion blooks oan bo 

proohargod low during phaso 1 and conditionally ehargod high 
during tho evaluation of tho logio, phaso 2 -- thus 

achieving a r i pp 1 e- t hr ough offoot and saving one signal 

inversion and two logio gates. 

Tho ALU is preconditioned for logio evaluation 
(proohargod) during phaso 1 and tho logio is evaluated 

during phaso 2. Tho data is transferred over tho buses 
during phaso 1 with tho buses proohargod low during phaso 2. 
Tho two input latohes, A and B, have multiplexing gates 
whioh select signals from throe souroos: 1) For latoh A tho 

possible signals aro Bus A, tho shifter output, and tho 

shi f t -oont r :> 1 signal. 2) For latoh B tho possible signals 

aro Bus B, tho shifter output, and tho shift oontrol signal. 
Those signals aro solootod during phaso 1 and latohod during 
phaso 2. Tho two output latohes sample only tho output of 
tho It blook during phaso 2. Either output latoh oan drive 
either bus during phaso 1. 

Tho oarry-ohain output drivos tho oarry input for 
tho next bit-slioe. After several bit-slioes aro added 
tooother, tho disohargo delay through tho several 
pass- t r ans i s t or s can Looome oxoossivoly long. By utilising 
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thi Cin signal generated to drive ths It block as th# drive 
signal (or ths oarry propagation gats on a parlodio basis, 
tha pass-gate logio is buffered to minimise tha oarry 
propagation dalay. 

Ths data busas ars praohargad low during phasa 2 and 
conditionally ohargad high through tha anablad PMOB 
pass-gata as illustrated in Figure 12. Tha two port 
ragistar oall with its PMOB drivers to either bus is shown 
in Figure 10. 

Between tha ragistar and tha ALU is a barrel shl(tor 
whloh oonostenaies tha two buses with bus B being in tha 
lower significant bit po itlons. With tha shift oonstant 
equal to saro tha shifter output oorrasponds to the A bus. 
With a non-ssro shift oonstant, * h a output is shifted down 
tha eorrasponding number of bits. 'he most significant bits 
from the A bus are omitted and tha most significant bits 
of the B bus are inoluded. Figure 11 illustrates the 
pass-gate logio used in a 4 s 4 bit barrel shifter. For the 
Bandia process tha shifter is implemented in PMOB with the 
shift output praenargad low in phasa 2 If data is read out 
of a register to an input latch, the transfer ooours in 
phase 1 through three PMOB pass-gates: a dual port read 

switoh, % shifter switch, and the multlpleser at the latch 
input . 

The shifter also aooomodates data transfers between 
bos A and a literal -port to the controller. The controller 
oan supply a literal in phasa 1. Data transfer in the 
reverse direction is possible in the opposite phase. Figure 
12 illustrates the bi-lateral buffer. 

A tri-state I/O buffer is given in Figures 14 and 
IS. Tha output is latohed from either bus and enabled with 
an asynchronous signal with the latoh driving either bus 
during phase 1. An appropriate number of inverter stages is 
inoluded in the buffer to aohieve the necessary oapaeitance 
driving capability. 
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V. Additional Comments 

This design of ths functional slonsnts for a data 
path logio provides ths foundation for implementing a 
mi oropr ooessor with the MTA oonoept. In this regard either 
the struotured SLA desoribed in another part of this report 
and/or oonventlonal STAR oalls oan be used to implement the 
oont rol loglo. 

Th e data path or bit-slioe approaoh is quite 
flexible and is compatible with the oonoept of 
mioroprogr amminq with a PLA type struoture. Eleotrloally 
reprogrammable PLAs oould even provide field ohanges in the 
basio Instruction. The popularity of the bit-slice approaoh 
is exemplified by reoent announcements from Hewl et t -Paokard 
and Bell Laboratories. It should also be noted that the 
Bellmao32 from Bell Laboratories is a 32 bit prooessor 
implemented with CMOS whioh uses their poly cell (standard 
cells) structures in oonlunot ion with a data path funotion. 

The detail drawings of the functional oells for the 
data path are provided in the appendix. 
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STORED LOGIC ARRAYS - SLA* 
IMPLEMENTED WITH 
CLOCKED CMOS 


SUMMARY 


Stored loflio array* are folded PLA* with tha AND 
and OR plana* merged into ona physioal spaoa. Tha 
struotura is shown to ba oompa tibia with tha STAR layout of 
transistor*. As a rasult tha STAR foundation wafars, or 
similar struotura, oan ba usad to implamant tha struoturad 
loqic assooiatad with PLAs . This struotura offars tha 
potantial for iaplamantino tha control struotura for tha 
Mosiao Transistor Array. In addition tha struotura offars 
an organixad approach for tha distribution of logio gatas 
in state nachinas and as such may raprasant an approach to 
untangling tha plaoamant and routing difficultias in random 
logio . 

In ordar to afficiantlv oompact tha logic onto tha 
ragular grid of tha STAR, it is nacassary to davalop a naw 
stylo of CMOS oirouitry whioh doas not utilise tha olassio 
CMOS pair of transistors for aaoh fan-in. A clookad 
variatv of CMOS is davalooad which aohiavas highar layout 
dansity and parfornanct . Concapts of s imu 1 t ana ou s 1 y 
praoharqing a sarias of gatas ara davalopad . At an 
appropriata clocking signal tha logic is avaluatad with tha 
signals >, rippling >l through tha logic. This ooncapt has lad 
tha author to tha tars "rippit u logic. It is shown that 
tha logio is not general; oonsaquan t 1 y , tha oompromisas ara 
davalopad ralativa to high parformanca, ragular logio 
arrays. Sinoa dynamic techniques ara utilisad, oaraful 
oons i dar a t i ons of oharga splitting ara observed. As such, 
olookad CMOS concepts ara prasantad in a form suitabla for 
tha uninitiated readers. 
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STORED LOCIC ARRAYS - SLAs 
IMPLEMENTED WITH 
CLOCKED CMOS 


I. Introduction 

Gat# arrays have btooni inortu inoly popular in th# 
last f #w y#ars as a naans of implementing ooapltv logio 
with roduotd engineering cost. Th# NASA STAR is a 
double-layer netal CMOS version of a gate array. The STAR 
has at least two features whioh offer consideration of an 
alternate design aporoaoh: 1) The STAR is composed of rows 
of NMOS transistors and rows of PMOS transistors with the 
neighboring transistors always ooupled via the souroe/drain 
diffusion. 2) The gates of the transistors are left 
unoonneoted through the standard layers and are conneoted 
with the or og ramming mask levels. Array logio suoh as 
programmable logio arrays (PLAs) or storage logio arrays 
<SLAs) offer an organised means of placement and routing 
for logio with some sacrifice in performance. 

Classic CMOS circuitry reauires signals to drive 
pairs of transistors instead of one, as normally used in 
NMOS. This requires greater area for interconnections and 
transistors. However, olookod CMOS, with +he ooncept of 
precharging oirouit nodes and then conditionally 
discharging them, requires only one transistor per signal 
input. Furthermore, it offers the potential for speed 
enhancement. A requirement for this oirouit style is that 
the gates for NMOS and PMOS pair transistors remain 
unoommitted, a feature found in the STAR design. 

An SLA is a particular variety of PLA whioh is 
folded with the OR and AND planes merged together. The 
latohes ( or flip-flops ) can be plaoed around the logio 
array. The actual sise of the AND and OR gates for 
particular signals oan vary with the actual applications, 
thus providing folding opportunities not normally found in 
PLAs. The STAR design offers the potential of implementing 
the SLA on its standard structure. The SLA design can be 
part of a STAR chip design and merged ir. with conventional 
gate implementations. 

With these oonoepts in mind the SLA design is 

pursued . 
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! I . Clooktd CMOS 


A. Basio ConoftDt 

There irt »«vir» 1 olooktd CMOS oiroult styles. All 
are based on the oonoept of preoharged Batts. Defining an 
N-Gate as a olooksd CMOS gats whioh utilises NMOS 
transistors for logio evaluation, ons simply usu a FMOS 
transistor to prsohargs ths output unconditionally to ths 
positive supply cotsntial during sons prsohargs olook 
signal and conditionally disohargss ths output back to 
eround through a s«r iss-oonnsotsd MHOS transistor whioh is 
switohsd on with an evaluation olook signal. If ths 
ss r 1 ss / par a 1 1 s 1 combination of NMOS transistors whioh form 
ths logio funotion offers a path to ground, ths output is 
discharged during ths evaluation period. If ths logio 
dsvioss do not offer a oonduotion path or ths evaluation 
dsvios is not turned on, ths output remains in its 
prsohargsd stats subject to ths leakage discharge of ths 
node. This leakaae time oonstant oan be eipeoted to exceed 
1 mseo with tvpioal values being in the order of one 
second. This information associated with the oharged state 
of ths output is dynamio in nature and requires sampling 
with approoriate timing constraints. One can realise ths 
logio function of the gate by noting ths condition 
associated with discharge is producing a M 0" , for positive 
looic. If two devices are in series, both must be on for 
discharge, resulting in the NAND function being realised. 

If devices are in parallel, either oan discharge the 
output ; hence, the NOR funotion is realised. The more 
general panllel/series arrangement oan be analysed in a 
s imi 1 ar manner . 

The F-Cato is defined in a dual manner to the 
N-Gate. The 1 og i o - f o rming transistors are implemented with 
FMOS devices and the precharqing load devioe is a NMOS 
transistor. The output is unconditionally precharged to 
the "0“ state and conditionally charged to the positive 
supply through the PMOS logic during the evaluation period. 
With series FMOS logio transistors the logio formed is the 
NOR funotion, assuming positive logic. With parallel logio 
transistors the logio formed is the NAND funotion. In 
general, the se r i es- par a 1 1 e l combination of PMOS 
transistors must be the dual of the pa r a 1 1 e 1 -se r i es 
combination of NMOS transistors for a given logic function, 
just as for classical CMOS. 

Consider the following oonoepts illustrated in 
Figure 1: The information in the state flip-flops is 
stored in more-or-less conventional CMOS latohes with 
aotive output drive in both direetions. Looal feedbaok in 
the latoh provides indefinite retention of the information. 
The clooked P-Gates and N-Cates are oreoharged to their 
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clocked cmos 



n-gate def initioning p-gate 

concept 



standby lnforastlon la storsd m atatle t/f *• and olooked gates are preoharged 
evaluation takes pleas in one olook period in a ripple through banner 


FIGURE- 1 CLOCKED CMOS CONCEPT 
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appropriate levels during t ho prooharpo 
or s+and-by period, a period whioh oan last an indefinite 
t iae sinoe the state information is stored in statio 
latohes. N-Catas drive F-Cates and vice-versa. Similar 
pates never drive eaoh other, e.g., N-Gates never drive 
H-Cates. One observes that the olooked pate inputs, if 
driven from an opposite type, are preoharged to the 
oondition whloh turns the lopio devioes off. Consequently, 
no ooaplete conduction path oan exists between the power 
supply and ground in this preoharged state, even if the 
evaluation devioe is removed. The pates driven dlreotly 
from the state flip-flops, however, do not neosssarily have 
their loglo devioes turned off; oonsequen t 1 y , evaluation 
devioes are reauired for these stages in order to achieve 
near sero stand-by power. In principle: a string of 

olooked gates of alternating types oan be preoharged during 
the stand-by period; the preoharge oiooks oan be removed; 
the evaluation signals oan be applied to the state 
flip-flop driven stages; logic signals can then propagate 
through the string in a ripple-through manner limited only 
bv the transition times of the individual stages. 

I . Obser va t i ons 

Timing signals are reouired for the preoharge 
(stand-by) phase, the logic evaluation phase, ind the lateh 
enable phase. No additional timing oiooks are required 
un like the case of olooked, non-ratioed, single-polarity 
circuitry with multiple logic phases. The ideal timing 
oould be aohieved with a single svstem olock . For example, 
the svstem olook oould represent the stand-by phase when 
the olook is low; when the clock goes high, the evaluation 
phase begins and oontinues until the olook returns to the 
low state whioh enables the flip-flops (edge triggered). 
Dynamio information is used during the evaluation phase; 
consequently, there is an upper limit on the pulse width 
for the system olook of about 1 ssto . 

The propagation delay for the r i pp 1 e- thr ough 
olooked CMOS should be faster than for olassioal CMOS or 
olooked s ing le-oo 1 ar 1 1 y oirouitry The discharge delays 
are aohieved through oommon-sour oo configured FCTs , the 
fastest type, wit** only single transistor loading per 
fan-out. Eaoh transition delay begins when the input 
signal reaohes the devioe threshold instead of the circuit 
threshold of ratioed or elassioal CMOS cirouitry. There is 
no added delay assooiated with additional evaluation oiooks 
as in the oase of single-polarity docked non-ratioed 
oirouitry. 

Classioel CMOS gates or statio gates oan be used in 
Dlaoe of olooked gates; however, the preoharged levels must 
be oompatible as before. This may be beneficial sinoe the 
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observations 


atatio oirouita oan be introduced, however tha preoherped etapee auat 
beooapatible. (lea etand-by power) 

-exaaple 



0 


y 


i ■* 

tha p-pate oould ba atatio 

ita inputs ara preoherped high by tha n-pata outputs 
tta output Mill ba preeharped low 

- y, 

ita output can only drive n-patss 

addinp extra olooka provides aora flaxibility at tha oaorifioa in 
aaynonrenoua bahavior 

FIGURE-2 CLOCKED CMOS OBSERVATIONS 
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olisiio oitt supplies lotivi drive from both directions, 
although compromising input loading. 

It should be noted that the output of any stage can 
drive any ‘’downstream 11 stage of the opposite type. As an 
example, let the first stage from the flip-flops be stage 1 
and of N-Type. The P-Gates then fall on the even number of 
stages and any of them can be driven from stage 1. 

Excra evaluation docks can be added for more 
flexibility although asynchronous behavior is sacrificed. 
The added flexibility may be desired in order to achieve a 
more general form of logio funotion. Driving an N-Gate 
witn another N-Gate is an example. This subject is dealt 
with in more detail in the section implementing the PLA 
logic f unc t i on . 

Figure 3 illustrates a ripple realisation of the 
AND-GR function needed in PLAs . The first stages are 

N-Gates, series NMOS to provide the NAND function. The 
seoond stages are P-Gates, paralleled PMOS also to provide 
the NAND. The NAND-NAND is one implementation of the 
AND-OR . The output of the second stage is precharged low 
and conditionally charged high during the evaluation phase. 


One notes that an entire NAND -OR funotion can be 
imolemented in the first N-Gate staqe by paralleling 
(providing the OR funotion) additional disoharge paths to 
ground through series devices (providing the AND function). 
A consequence of this two-level gate is that the implicants 
(the AND terms) oannot be shared with other OR gates. 
Another consideration is the comparative delay associated 
with ser i es- oonneo t ed gates having extended fan-in (F/I). 
Larqe PLAs require large F/I and, preferably, the 
implicants can be shared. As a consequenoe, parallel 
configured gates are explored next. 

C. Basic Types of P a r a 1 1 e 1 -Conne o t e d CMOS Gates 


Figure illustrates five different versions of 

clocked CMOS which utilise parallel connected NMOS 

logio devices. The classic CMOS gate is also included for 
illustrative purposes. There are two families of docked 
CMOS: the ripple variety without the evaluation device and 

the gated variety which responses to an evaluation signal. 
Dual versions exist in the P-Gate type, providing a total 
of ten versions of clocked CMOS gates using 

p ar a 1 1 e 1 -conne c t s d logic devices. 

The first of the ripple varieties utilises 
common-sour ce logio devices and is labeled NR i which 
indicates N-Gate, Ripple, and inverting output. Assuming 
the N-Gate type, the logio function is NOR > the inputs are 
precharged low; and the outputs ars precharged high. This 
gate implementation provides the fastest response of all 
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the •valuation davloa in tna p-gate oan be raaovad 

-if all of tna lnputa are praohargad nigh by n-gatae 

the logie gataa 'downstrsaa* oan only discharge in ona direction 
there oannot bo a raoa condition 

logie daoiaiona are propagated aaynohronously during the svolkuation ties 
oan drive the p-oata but 
-an evaluation davloa la required 

the legate at stage 1 oan drive p-gataa at atagaa 8. 4. 6. etc. 
observe: the series n-gate. nand function: the parallel r -gate, nsnd 
function. 

a parallel-aerlsa ooablnatlon at one stage also provides and-or funotlons 


FIGURE-3 AND-OR WITH RIPPLE CMOS 
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t he viri ions . 

The stoond of the ripplo varieties utilises 
louroft-fol lower configured lopio devioes, whioh results in 
a non-inverted output, and is labeled NRn . For the N-Gate 
the input and outputs are preoharged low; the logio 

funotion is OR. Since the output voltage swing is not all 
the way to the supply ootential -- limited by a threshold 
voltage below the rail — this version is generally not 
useful and only used in speoial circumstances. Since the 
body-effect cn the threshold voltage lowers even further 
the output swing, this version is especially dubious for 
the logic devices fabricated in "wells" , suoh as NMOS in a 
P-wtll process. This variety is approximately an order of 
magnitude slower than the inverting version -- even to 
reach its limited output swing. As an example, for a 
F-well prooess and with a 5 volt supply, the output swing 
is approximately from ground to 2.5 - 3 volts. Requiring 
the output to swing only over a limited range, near ground 
fcr the ft-Gate, improves its usefulness. 

The first and third of the gated versions utilise 
common -sou r c e devices and provide the NOR logic funotion. 
Both provide full r a i 1 - 1 o - r a i 1 signal swings; the outputs 
are precharged high; and the inputs need not be precharged 
The difference between the two relates to the location of 
the evaluation device. 

The label for the NGa version refers to the Gated 
varietv and the fact that its inputs must be driven from an 
"active" sourcing output to prevent oharge splitting. Fcr 

example, as illustrated in Figure 5, the outputs driving 

this gate must be able to supply current from the positive 
supply for a "1" input when the evaluation device is turned 
on. During preoharge the output of the NGa gate is 
precharged high. One input is presumed to be precharged 
high, resulting in the (common) sources of the logic 

deviees being preoharged high to within one threshold below 
the supply voltage. The channel is only weakly induced in 
this case. When the evaluation device is turned on, the 
source node is pulled to ground with coupling through the 
full gate oapaoitanoe to the input nodes. NOT# that if 
there is no aotive source for charging this coupling 

capacitance, then the input signal level is pulled toward 
ground, reducing the effective drive of the input. The 
amount of this signal loss depends on the effective node 
capacitance of the input and the coupling capacitance of 
the gate. A precharged output for a simple inverter with a 
Urqe fan-out requirement would lose significant signal. A 
classio inverter or a P-Gate can supply the necessary 
current to provide the full input drive voltage. One also 
notes that the evaluation devioe can be shared with other 
NGa gates. 

The third version of the gated variety, the NGc 
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version, has Its evaluation dtvict bitwian tha output nod* 
and tho oommon drains of tho loose dovioes. Tho "e" riftti 
to tha output requirement of driving a large oapaoitanoa 
load whan tha fan-in is larqa to minimis# charge splitting. 
Vhan tha praoharga phasa is present, tha output is 

praoharqad hiqh and tha oommon drains of tha logio davloas 
ara prasumad to ba disoharqad low. If tha inputs ara 
subsaquantly disohargad to ground and than tha avaluation 
davioe is turnad on, only tha output oapaoitanoa is 
availabla to supply oharga *n ordar to oharga tha oommon 
drain noda. Tha rasulting oharga splitting may raduoa tha 
output signal to unaooaotabla lavals. For example, if tha 
output is driving a simpla invartar (minimum fan-out) and 
tha fan-in is larqa, than tha affaotiva oapaoitanoa on tha 
oommon drain noda may wall dominata tha load oapaoitanoa, 
loading to affaotivaly no "1" output. 

Anothar varsion of tha gatad variatias of clooked 
gatas is labalad NCb for “bootstrap" from tha natura of tha 
cirouit aotion. Tha souroas ara tied to tha avaluation 
clock wh i oh is at ground during praoharga. Tha drains of 
tha logic and load davloas ara oonnaotad to tha output. 
Sinoa tha praoharga olock is high during praoharga, tha 
output is disohargad low durinq praoharga through tha load. 
Tha inout signals ara fad through tha sarias davicas during 
praoharga to the gates of the logio devioec. At tha and of 
tha praoharga phasa tha sarias sampling davioes ara turnad 
off, leaving tha logio davioa gates charged to soma state. 

If all of tha inputs are low, than no ohannals exists 
between tha avaluation olock and tha output; consequently, 
very little ooupling exists between tha avaluation olock 
and the logio gates. When tha avaluation elook goes high, 
tha cutout remains low, at qround. On the other hand, if 
any one of tha gatas is charged whan tha avaluation olock 
qoe hiqh, there is a strong coupling between tha 
avaluation olock and tha input through the gate oapaoitanoa 
which bootstraps the gate up with the rising evaluation 
elook. Tha channel of tha high input remains fully 

conductive, and sources current from tha avaluation clock 
to the output to charge it high -- even to the supply 
voltage. If any one of the inputs is sampled high prior to 
the evaluation olock, tha output is subsaquantly driven 
hiqh, generating the OR function. Tha input sampling 
davioes provide isolation between the charged gatas of tha 
logic devices and the outputs of tho previous stages; 
however, they may constitute an unaooaptabla addition to 
tha circuitry. If they ara delated and tha inputs ara 
driven from PMOS devices, than tha bootstrap aotion drives 
tha drain junction of tha PMOS into forward bias with its 
associated injection of holes — a sura method to lead to 
SCR action and latoh-up. Consequently, this version of 
gated CMOS is likely to have limited utility in bulk 
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.sv.i prooharge 
phase e-0 

output discharged low thru phase e 

input signal? transferred to the logic device gate? 

transfer devices are turned off after precharge 

e e— high for evaluation 

if all cf the inputs were lew, there is little oapaoltar 
coupling Cc to the storage node 

if one of the inputs ir high a channel ir forced and Co 
the output is driven high with bootstrap action 
"or" function for n-p ate 
"and* function for p-gate 


i* the r.-tyce transfer de 'ice ir not used and a piuor ir 
the storage circuit, then the px*or drain dioce ir driven 
forward tier with bootstrap action, 
the result ir latch up — 
unless? one of the folio 'ing 
i.' use nruor transfer gate 
c..) use substrate bias or 
S.) ufo 303 cr equivalent 


FIGURE 6 - CLOCKED CMOS GATE 
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prootiits The method* to provide isolation to prevent 
1 a t eh-up , eg., substrate bias or th# inclusion of th# 
s#ries sampling devio#s, are likely to result in an 
impraotical pat# imp 1 im#n t a t i on . For fundamentally 
isolated structures such as 508 or las#r annealed poly over 
oxide, the bootstrap pate offers future consideration 
because of its non- i nver t i np logic functions with 
reasonable speed. 

With these possible candidates for logic gates the 
question remains: Which of the many forms represents the 
best implementation for PLAs using the CMOS technology? 

III. Programmable Logic Arrays 
A. The PLA General Function 


Th* typioal blook diagram of a PLA implemented in 
NMOS is presented in Figure 7, Latches with sampling at 
phase 1 provide the complimentary signals and temporary 
storage during the evaluation period. The AND plane, as 
illustrated, consist of ratioed NMOS in NOR gate form. The 
OR plane consist also of ratioed NOR gates. One notes that 
complimentary signals are used as inputs and outputs. The 
output of the OR plane feeds the output inverting buffers 
(latohes) which are enabled with phase 2. Since ratioed 
logic is used, the evaluation period overlaps phase 1 and 
phase 2 . 


Clocked CMOS PLA Timing Concept 


A master 
events, although 
from the master . 


clcok is hypothesised which controls four 
other clocks may be required and derived 


1) Static CMOS latohes are used with data docked 
in at the trailing edge of the master clook. 

2) The PLA is precharged in the stand-by state 
(clock low) to facilitate fast logic evaluation. 

3) At th* rising edge of the master clock the 
"state" and control inputs are gated into the PLA. 

4) The signals "ripple' 1 through the AND-OR planes 
while the clock is high. 


Another important assumption is that the 

requirements are large which implies parallel gates. 


fan-in 


C. Ripple AND-OR Planes with Parallel Devioes 


Figure 8 illustrates the sixteen possible basio 
functional oombinations which provide the equivalent AND-OR 
function. The task is to select from the possible twenty 
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olooktd , piril U1 CMOS pit** which cm be ui«d in these 
lixtttn btslo functional combinations The OR plana ia 
aitunad to ba of tha rippla variaty ; what ara tha 
pots 1 b i 1 i t i aa t Without regard to tha polarity of tha 

output, which oan ba handled in tha latchaa, four baaio 
combination* ara poaaibla: 

1) NOR 2> OR 3) NOT-NAND 4) NOT- AND 

From tha rippla varaiona for thaaa functions ona obaarvaa 
that all raquira that thair inputs ba praoharga low and 

conditionally ohargad hioh during avaluation. How can tha 
AND plana ba implamantad from aithar tha gatad or rippla 
varieties? This t ina tha polarity of tha inputs ara of no 
oonoarn. Ona finds tha following posa i b i 1 1 t i aa : 

1) N AND -NOT 2 > AND 3) NOR 4) OR -NOT 

Ona observes that all "AND" outputs must ba praohargad high 
and than ba conditionally discharged low in contradiction 
to tha OR plana input requirements. Tha assumed objectives 
ara not poaaibla. 

It should ba no surprise that tha same conclusion 
is reached if an OR-AND funotion is sought. All possible 
outputs for the OR plana ara required to ba praohargad low 
in contradiction to tha possible inputs for tha AND 
plana. Hence, the conclusion is that no possible 

configuration of oarallel devices can ba realised in a 
non-ratioed r i pp 1 a- 1 hr ouqh form. Tha alternatives ara to 
cons i da r : 


1) Series forms 2) Ratioad forms 3) Gating tha OR plana 

Tha last appears to bast satisfy tha requirements of high 
performance for large arrays at low power. 

D. Gatad AND-OR with Parallel Davioas 

Of tha sixteen possible combinations twelve oan ba 
eliminated if tha bootstrap gates ara disallowed due to 
latoh-up and tha sou r o a - f o 1 1 owe r configurations ara 

disallowed due to spaed arguments. Tha four remaining 

fo 1 low; 

1) NAND-NOT-NOT-NAND 2) NOR-NOR 
3 > NAND-NOT-NOR 4 > NOR-NOT-NAND 

Tha first has too many levels of logic to ba practical. 
Tha NOR-NOR version will result in unaooaptabla oharga 
splitting Consider that tha OR plana is lightly loaded 
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vinot it drives simply Utohit . This implies thst ths OR 
plans NOR must not bs of th# KCo version. Ths remaining 
viriion NGa rsouirss an aotivs souros from ths prsvious 
stags, not achievable with olooksd N-Cates. As a rssult 
only ths last two oombinations naad furthsr cons i da r a t i on . 
Ons notss that ths oandidatss ars really ths AOX and 0A1 
functions, ths duals of saoh other. This duality allows 
one to develop a design in either version and translate 
that design into the other by using the duality prinoipls. 
One also observes that saoh oontains a NMOS plane and * 
FMOS p 1 ane . 

Consider the KANO -NOT -NOR version for further 
study. Ths second level of gating can ooour at either the 
inverter or the N-Gate. Both gates which make up the logio 
planes are driving minimum capacitance and have large 
fan-in, thus requiring the '‘a" gate versions. Ths 
resulting ohoioes are then reduced to the following: 

(1) (PGa or PR i ) - < o 1 ass i © inverter )-NGa 
< 2 ) ( PCa or PRi)-FGc-NRi 

Ths gated inverter in ths seoond version is assumed to be 
limited to the "o” vsrsion beoause no active drive is 
available from the previous stage. The PCo gate requires a 
large output load capacitance If this is not ths ease in 
ths minimum fan-out oi ons, then unaooeptable oharge 
splitting may result. Possibly in some designs the 
resulting charge splitting is less using ths PCa inverter 
since the output of the first stage is likely to have a 
high output capaoitanoe. These compromises in oharge 
splitting relative to particular designs and programming 
lead to the oonolusion that ths classio inverter is a far 
safer approach in spite of its possibly slower performance. 
For this reason the olassio inverter approaoh has been 
selected for this design effort. Figure 9 illustrates this 
approach schema t i ca 1 1 y . 

E. Input Partitioning 

ANDinp together two signals before entering the AND 
plane is sometimes used as a means cf compacting the 
design. This partitioning is considered as a programming 

option. The inputs to the PLA are assumed to run globally 

around the ohip and be subjeot to noise. Adding a buffer 
inverter inproves the signal level for sampling which 
ocours during the preoharge state. The Information is 
stored as oharge on oapaoitance <gate) during the 

evaluation state. Double inverters provide the neoessary 
aotive drive for the true and oompliment signals. The 
input partitioning can be aohieved with NOR gates 

implimentec as NCa versions whioh supply the evaluation 
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gating (or t he AND plan* o£ * ha PLA . Th* AND plan* oan th*n 
b* implemented with rippl* gat** PR1 (or higher 

performance. This approaoh pin* alternate* ar* illustrated 
in Figure 10. In sera* designs where th* input oapaoitano* 
to the AND plan* is assured to be su((ioi*ntly large, NGs 
gates oan be used with the associated elimination o( one 
input inverter . 

F. Latohes 

Figure 11 illustrates various versions o( latches. 
The switoh notation with a N, P, or C indioates a 

transmission gat* iaplimented with a NMOS transistor, a 
PMOS transistor, or both as in olassio CMOS. The olassio 
CMOS transnission gat* utilises th* oosaon-souroe operation 
(or both devices to provide rail to rail ooupling whereas 
the single polarity types sutler a threshold drop in signal 
when ooupling a* souroe followers. 

Th* classic NMOS version o( a latoh is indioated 
(or re(ereno*. It th* CMOS version is implemented with 
CMOS transalssion gates, aotiv* drive in both direot ions is 
requited as an input. It the input sanpling devioe 

represents one o( several aultiplexing switohes, then two 
saapling clocks (or eaoh input are required. By adjusting 
the gain o( th* (irst Inverter and utilizing only th* 
transistor not (abrioated in th* "well", on* o( the 
compl iaent saapling olooks oan be eliainated. 

The ooaaon CMOS saapl e-and-ho 1 d oirouit, sometimes 
referred to as the "H" latoh is also illustrated. This 
version sutlers also in not providing symaetrio drive ( r oa 
th* two outputs. A major advantage o( this type, however, 
is that the input impedance is only oapaoitano*. Th* input 
signal oan be ehargo on this oapaoitano* at th* time o( 
enabling th* latoh unlike th* (irst versions. 

A simplitled version ot th* "H" is illustrated 
whioh utilizes th* faot that th* input is preoharged high. 
At phase 2 the input is sampled and enables the latoh. Th* 
(eedbaok path is illustrated as a high impedano* path in 
order to minimize the olooks required. Th* drive from the 
input sampling inverter aust over power th* (eedbaok in 
this oase . Alter the signal propagates through th* two 
inverters, th* lull logio levels ar* generated with near 
zero stand-by power achieved. It only temporary storage o( 
int oraa t ion is required, then the (eedbaok is not required 
at all. 

A dual input (ora ot a latoh is also shown. This 
version requires suttioient ratios ot th* pull-down devioes 
compered with the Internal FMOB devioes such that th* 
internal latoh oan be upset. As before, after th* input 
signal h«s propagated through the Internal latoh. th* logio 
levels ar* fully established and near zero stand-by power 
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is aohi tved . Output dr i v« buffers art included to supply 
adequate souros ourrant , Thors is on advantag* of this 
oonf iguntion: sinoo tho sot tnd rosot transistors art 

turnod off during tho proohargo (stand-by) state, only 4 
low-to-high signal from tho OR piano is required for 
sotting or rosotting, without any olook timing signal 
required for onabling. Tho disadvantago is that two outputs 
from tho OR piano aro roquirod for each flip-flop -- even 
though thoso involvo loss oomplox functions. 

C. PLA with Single-Ended Latohes 

Figuro 12 illustrates a oloekod CMOS PLA with a 
s i ng 1 o -ended latoh. It uses tho PRi-olassio inverter-NCa 
version. It also uses an extra olook with whioh to gate 
tho latoh in order to prevent the glitch in tho latch 
output whioh would ooour if the latoh is enabled at tho 
same time as the OR plane. Sinoo tho outputs aro isolated 
from the PLA inputs during this time by tho input sampling 
dovioes, not shown, this glitoh does not constitute a logic 
error *n the PLA. Tho system requirements may not allow 

it, however. If tho glitoh is not harmful then phase 1* 
can bo morqed with phase 2, saving one dock. 

H. PLA with Dual Input Latches 

Figure 13 illustrates tho PLA/SLA with dual input 
flip-flops. In order to achieve good ratios in the latoh 
tho set transistors aro selected as NMOS with their inputs 
prechargod low. This requires that their inputs aro driven 
by PCa gates. This implies tho dual version of tho PLA, 
i.e., tho OR plane is implemented in PMOS , and tho AHD 

piano is implemented in NMOS. One additional timing event 

is required: the beginning of the stand-by or precharge 

state. A one shot type action is required with the aotion 
triggered by the falling edge of the master olook. 

IV. Star Layouts With Dual Input Latohes 

Figure 14 illustrates a possible chip floor plan 
for a SLA using the attaohed oell designs whioh are based 
on a STAR type foundation. The design shows the output 
buffers around the ohip with an interconnection bus routing 
signals around the ohip Just outside of the buffers. 
Internally in the ohip is the folded array with the AND 
plane made up of parallel NMOS devices oonneoted along 
oolumns (vertioal). The inputs feedin along rows from the 

outside through the output buffsrs and the latohs to the 
input oirouitry. Along eaoh row two signals with their 
oompliments are routed into the AND plane. The implioants 
are generated from oolumns of NMOS devices with their 
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DYNAMIC CMOS PLA 

(with altarnato timing for no output glitch) 



requires another clock signal 


FIGURE-12 
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outputs feeding into two dedicated rows whioh provide ths 
invert tri and the "load* 1 dtviots for ths i&pl loants . Ths 
outputs of ths invsrtsrs ars fsd back down ths ooluan to 
ssrvs as inputs to ths horisontal rows of paralisl 
connsotsd PM08 serving as ths OR plans. Ths outputs of ths 
OR plans run horisontally baok to ths sst and rssst 
transistors of ths latohss. Mi sosl 1 ansous row orisntsd 
dsvioss ars grouped into ths NCOM and FCOM os 1 Is bstwssn 
the input oirouits and ths array itsslf. It should bs 
notsd that ths actual boundariss of ths various gatss ussd 
in ths AND and OR planes oan bs moved to fit ths individual 
requirements for ths logic in question. This is possible 
sinos ths arrays ars back- t c~baok (merged). 

Ths detailed oell designs are included in the 
foil owing figures. 
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